Picture for Dongyeop Kang

Dongyeop Kang

UC Berkeley

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Add code
Apr 28, 2025
Viaarxiv icon

Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation

Add code
Apr 26, 2025
Viaarxiv icon

LawFlow : Collecting and Simulating Lawyers' Thought Processes

Add code
Apr 26, 2025
Viaarxiv icon

Learning Explainable Dense Reward Shapes via Bayesian Optimization

Add code
Apr 22, 2025
Viaarxiv icon

Align to Structure: Aligning Large Language Models with Structural Information

Add code
Apr 04, 2025
Viaarxiv icon

A Framework for Robust Cognitive Evaluation of LLMs

Add code
Apr 03, 2025
Viaarxiv icon

Learning a High-quality Robotic Wiping Policy Using Systematic Reward Analysis and Visual-Language Model Based Curriculum

Add code
Feb 18, 2025
Viaarxiv icon

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Add code
Feb 13, 2025
Viaarxiv icon

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

Add code
Feb 05, 2025
Figure 1 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Figure 2 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Figure 3 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Figure 4 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Viaarxiv icon

Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations

Add code
Oct 02, 2024
Figure 1 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 2 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 3 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 4 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Viaarxiv icon